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Abstract.  We  consider  regular  expressions  extended  with  the  interleaving  ope  -tor,  and 
investigate  the  complexity  of  membership  and  inequivalence  problems  for  these  expressions. 
For  expressions  using  the  operators  union,  concatenation,  Kleene  star,  and  interleaving, 
we  show  that  the  inequivalence  problem  (deciding  whef,.  er  two  given  expressions  do  not 
describe  the  same  set  of  words)  is  complete  for  exponential  space.  Without  Kleene  star,  we 
show  that  the  inequivaleiice  problem  is  complete  for  the  class  E?  at  the  second  level  of  the 
polynomial-time  hierarchy.  Certain  cases  of  the  membership  problem  (deciding  whether  a 
given  word  is  in  the  language  described  by  a  given  expression)  are  shown  to  be  N  P-complete. 
Special  cases  of  the  membership  problem  which  can  be  solved  in  polynomial  time  are  also 
discussed. 
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1  Introduction 


There  has  been  considerable  progress  in  classifying  the  the  computational  complexity  of 
decision  problems  involving  “regular-like”  expressions.  Such  expressions  are  similar  to  the 
Kleene  regular  expressions  of  finite-automata  theory,  but  may  contain  operators  on  sets 
of  words  other  than  the  usual  operators  union,  concatenation,  and  star.  Problems  which 
have  been  studied  include  inequivalence,  i.e.,  deciding  whether  two  given  expressions  do  not 
describe  the  same  set  of  words,  and  membership,  i.e.,  deciding  whether  a  given  word  is  in 
the  language  described  by  a  given  expression.  Previous  work  on  this  subject  can  be  found, 
for  example,  in  [FurerSO,  Hunt73,  HRS76,  Stock74,  StM73];  see  also  [AHU74,  HU79].  In 
particular,  we  focus  here  on  the  interleaving  operator.  The  interleaving  of  words  x  and  y, 
denoted  x\y,  is  the  set  of  all  words  of  the  form 

212/122^2 -  ••IfcyA: 

where  A:  >  0,  z  =  X\X2  ■  ■  -  y  =  yiyi  -  ■  -yk,  -T-nd  where  the  words  z,-  and  y,-,  1  <  i  <  i,  can 
be  of  arbitrary  length  (including  the  empty  word). 

The  motivation  to  investigate  the  interleaving  operator  is  twofold.  First,  the  interleav¬ 
ing  operator  can  be  interpreted  as  the  simplest  case  of  the  composition  operator  used  in 
algebraic  approaches  to  modeling  concurrent  computation.  Interleaving  represents  the  case 
where  processes  run  concurrently  in  such  a  fashion  that  their  atomic  steps  can  be  arbi- 
trarly  interleaved  but  where  no  communication  between  them  takes  place.  One  of  the  best 
known  formalisms  for  specifying  and  verifying  concurrent  systems  is  CCS  (.^e  [MilnSOj). 
In  (Miln84]  a  restricted  set  of  algebraic  operators  (i.e.  {-,11, «})  is  used  to  form  the  star 
expressions  in  CCS.  These  expressions  are  syntactically  identical  to  regular  expressions, 
but  instead  of  having  as  semantics  “sets  of  strings”,  their  ssHiantics  is  “equivalence  classes 
of  processes”.  In  [KS90]  it  is  shown  that  the  observational  equivalence  problem  of  star 
expressions  is  solvable  in  polynomial  time.  VVe  believe  that  the  techniques  presented  in  this 
paper  will  be  useful  to  determine  the  complexity  of  the  observational  eqnivalence  problem 
of  star  expressions  extended  by  a  suitably  defined  composition  operator.  This  is  an  open 
question  of  [KS90j. 

Secondly,  as  we  discovered  while  doing  this  work,  the  interleaving  operator  has  some 
interesting  properties  of  its  own:  Succinctness;  The  use  of  the  interleaving  operator  can 
shorten  a  regular  expression  by  an  exponential  amount.  Simulation  of  Integer  Addi¬ 
tion  and  Intersection:  Under  certain  format  restrictions,  addition  of  positive  integers 
and  intersection  of  expressions  can  be  simulated  by  the  use  of  the  interleaving  operator. 
Complexity:  The  inequivalence  problem  for  expressions  with  interleaving,  but  without 
star,  is  one  of  the  few  natural  problems  known  to  be  Eo-complete. 

We  now  outline  the  remainder  of  the  paper.  Definition  are  given  in  Section  2.  In 
Section  3,  we  present  a  language  for  which  a  succinct  expression  with  interleaving  exists  but 
every  regular  expression  is  longer  by  an  exponential  factor.  Section  4  illustrates  the  nature 
:f  the  interleaving  operator  via  the  membership  problem  restricted  to  expressions  containing 
1  constant  number  of  interleavings.  In  Section  5,  we  show  certain  cases  of  the  membership 
problem  to  be  NP-compIete.  One  such  case  is  the  problem  of  determining,  given  words 


z,ui,...,Un  (with  n  variable),  whether  z  can  be  written  as  an  interleaving  of  u\,...,Un- 
Sections  6  ,ind  7  are  devoted  to  the  inequivalence  problem  for  expressions  without  and 
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with  trie  Kleene  star,  respectively.  In  the  case  without  star,  we  show  that  interleaving  is 
powerful  enough  to  sunulate  addition  of  integers  under  certain  format  restrictions.  We  can 
then  emulate  a  proof  of  [Stock??]  to  show  that  the  inequivalence  problem  is  D-^-compIete. 
In  the  case  with  star,  we  show'  that  interleaving  can  simulate  intersection,  again  under 
appropriate  format  restrictions.  We  can  then  emulate  a  proof  of  [FurerSO]  to  show  that  the 
inequivalence  problem  is  exponentiai-space-complete. 

2  Definitions 

Basic  familiarity  with  regular  expressions,  time  and  space  complexity,  polynomial-time  re- 
ducibility,  and  complete  problems  is  assumed.  The  necessary  background,  if  needed,  can  be 
found  in  [AHU74]  or  [11117.9],  for  example. 

We  now  define  more  precisely  the  types  of  expressions  and  problems  of  interest.  Let  £ 
denote  the  empty  word.  Let  B  be  a  finite  alphabet  and  let  S  be  a  subset  of  the  operators 
{U,  •,»,n,|}.  We  define  tbs  S-expresstons  (over  E)  and  simultaneously  define  the  operator 
L  which  maps  each  5-expression  to  a  subset  of  E*: 

1.  For  every  a  €  EU  {e},  a  is  an  S-expression,  and  L{a)  =  {cr}; 

2.  If  ri  and  7-3  are  5-e.xpressions  and  @  e  5  -  { « },  then  (ri  @  ro)  is  an  S-e.xpression, 
and  L((ri  @  r2))  =  L{t\)  @  L(r2.): 

3.  If  T  is  an.  5-exprc-ssion,  then  (r*)  is  an  S-expression,  and  =  (L(r))*. 

In  2,  the  interleaving  operator  is  extended  to  sets  of  words  in  the  obvious  way,  i.e.,  L\\Li  is 
the  union  of  the  sets  Wi\w2  taken  over  all  wi  €  L\  and  u/2  €  12-  When  writing  expressions 
in  the  text,  ext-aneous  parentheses  are  often  omitted.  .-Mthough  it  is  sometimes  convenient 
;o  use  e  wh-n  writing  expressions,  our  results  do  not  change  if  expressions  cannot  contain 
€. 

Letting  5  be  as  above,  the  problem  .\IEMBER-5  is  the  problem  of  deciding,  given  an 
S-expression  r  a.nd  a  word  to  €  E*,  whether  w  G  L(r).  The  problem  INEQ-5  is  the  problem 
of  deciding,  given  two  5-c':pressions  rj  and  whether  L(r-,)  =  L{t2).  The  problem  NEC-5 
is  a  special  case  of  INEQ-S;  here  the  problem  is  to  decide,  for  a  given  r,  whether  i(r)  ^  E*. 

jtoj  denotes  the  length  of  the  word  to,  and  |rj  denotes  the  length  of  the  expression  r. 

It  will  be  useful  to  define  |  also  as  an  operator  on  nondeterministic  finite  automata 
(NFA’s)  M\  and  M2  in  such  away  that  L{M\  [.1/2)  =  MM\)  \  L{M2)-  Here  is  the  relevant 
definition  (see  'Eilen74]); 

Let  Ml  =  (Q(,  2,(f,,?0i)  ■f’j)  (i  =  li2)  be  an  NF.A  with  (in  notation  of  [.A1IU74])  state 
set  Qi,  input  alphabet  E,  transition  function  5,,  initial  state  pon  a-nd  accepting  states 
f,.  Then  M  =  M\  j  M2  is  defined  as  follows: 

M  =  {Q\  X  Q2iE,5, [poi,po2],-f')  where  the  new  transition  relation  6  is  defined  as 
'5((?!i?2jis)  =  (5i(?iic)  X  {?2})U({ffi}  X  (52(o2,a)),  and  the  new  set  of  accepting 
states  F  is  defined  by  F{[qi,q2])  =  Fi{qi)  A  F2{q2)- 

Note  that  the  number  of  states  of  M\  ]  M2  is  the  product  of  the  number  of  states  of  Mi 
and  the  number  of  states  of  M2. 
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3  On  the  Expressiveness  of  Interleaving 

In  this  section,  we  will  show  an  example  in  which  the  use  of  the  interleaving  operator 
shortens  a  regular  expression  by  an  exponential-  amount.  Consider  the  alphabet  Sn  = 
{<7i,(T2,..  .  ,<rn}  and  the  language  of  all  permutations  of  length  n  in  E’,  i.e.,  is  the 
set  of  words  of  length  n  in  which  ea'ch  symbol  cr,  appears  exactly  once.  Obviously,  we  have 
L„  =  L{a\\(j2\  ■  ■  ■  kn)-  But  as  the  following  proposition  shows,  there  is  no  standard  regular 
expression  of  polynomial  length  denoting  Ln- 

Proposition  3.1  Every  {U,  ■,*  }-expression  r  with  L{r)  =  Ln  has  |r|  = 

Proof;  For  5  C  Sn,  let  the  word  vj(S)  be  the  concatenation  of  the  symbols  in  5  in  order 
of  increasing  index.  Let  S  denote  the  complement  of  S  with  respect  to  S„.  Note  that  for 
any  5,  w{S)w{S)  G  Ln-  We  claim  that  the  number  of  subsets  of  En,  namely  2”,  is  a  lower 
bound  on  the  number  of  states  of  any  NFA  accepting  Ln- 

Assume  that  there  is  an  NFA  M  with  fewer  states.  Then  there  are  two  subsets  S  and 
T  with  S  ^  T  and  a  state  q  such  that  there  is  a  computation  path  of  M  on  input  ri;(5) 
from  the  start  state  to  q,  a  path  on  input  w{T)  from  the  start  state  to  q,  a  path  on  input 
w{S)  from  q  to  an  accepting  state,  and  a  path  on  input  w{T)  from  q  to  an  accepting  state. 
Assuming  (without  loss  of  generality)  that  S  is  not  a  subset  of  T,  there  must  be  a  aj  in  S 
which  is  not  in  T,  so  ffj  is  in  T.  Therefore,  M  accepts  the  word  w{S)vj{T)  which  contains 
two  occurrences  of  cr,.  Thus  any  NFA  accepting  must  have  at  least  2”  states.  Since 
any  regular  expression  r  can  be  converted  to  an  equivalent  NFA  having  (?(|r|)  states,  the 
proposition  follows.  □ 

4  Example:  Interleaving  of  a  Constant  Number  of  Strings 

To  illustrate  the  interleaving-operator  we  show  how  to  answer  the  following  question  in  a 
straightforward  way:  z  G  L(tii  \  U2  vck)  ?i  where  x  is  a  constai.'  ,  i  <  h)  and  z 
are  strings  over  some  alphabet  S,  and  lii,(  =  n  and  \z\  =  kn. 

1.  Construct  the  NFA  jW  for  ui  |  U2  [ . . .  | Uk-  Its  transition  diagram  will  be  an  nxn . .  .xn 
(fc  times)  grid  of  states.  Thus  we  can  think  of  it  as  a  k-dimensional  hypercube  of  side 
n.  Assume  that  the  start  state  is  at  the  “upper  left”  corner  and  the  only  accepting 
state  (Sd)  is  at  the  “lower  right”  corner.  Every  state  has  at  most  k  successors,  each 
of  which  has  one  coordinate  closer  to  s^-  Thus  there  are  0{n^')  states  and  0{n''’) 
transitions.  Note  that  every  path  from  the  starting  state  to  the  accepting  state  has 
length  kn. 

2.  Let  5  be  a  set  of  states.  Simulate  .V/  on  input  z  by  storing  in  5  the  states  which  .M 
can  reach  after  reading  the  prefix  of  z  consumed  so  far.  .After  reading  at  most  kn 
symbols  one  of  the  following  two  conditions  will  become  true:  (i)  S  =  {}  and  thus 
REJECT  or  (ii)  5  =  {sa}  and  .hus  ACCEPT.  Note  that  the  size  of  S  can  be  at  most 
0(n*”‘),  since  there  are  at  most  (9(n*~*)  states  at  distance  I  (J  <  I  <  kn)  from  the 
start  state. 
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It  can  be  easUy  verified  that  the  above  procedure  can  be  carried  out  using  0{ti^)  .ime  and 
0{n^~^)  space  on  a  unit-cost  RAM.  In  [vLN82)  a  dynamic  programming  algorithm  was 
used  to  improve  the  time  performance  to  0(n*/log^^^^"^^n).  This  was  further  improved 
by  (IPC85]  to  0(n''/log*/(''-^^n). 

This  result  is  easily  generalized  to  the  following. 

Theorem  4.1  For  each  constant  k,  the  problem  MEMBER-{\J,-,*,\},  restricted  to  expres¬ 
sions  having  at  most  k  occurrences  of  \,  can  be  solved  in  polynomiaitime. 

Proof:  Given  a  word  z  and  a  {U,  •,  ♦,  |}-expression  r,  there  is  an  NFA  having  0(|r|^)  states 
which  accepts  L{r).  The  NFA  is  constructed  as  in  [HU79,  Chap.  2),  where  interleavings  are 
handled  by  the  construction  described  in  Section  2.  The  NFA  is  then  simulated  on  input  z 
as  described  above.  □ 

5  Instances  of  MEMBER  which  are  ;V*7^-Compiete 

The  problem  MEMBER-{U,  •,  *,n}  is  known  to  be  solvable  in  polynomial  time  (see  the 
solution  to  Problem  3.23  in  [HU79]).  We  show  in  this  section  that  the  problem  MEMBER- 
{U,  •,  +,  n,  1}  is  ^Vr’-complete.  In  fact,  we  will  prove  an  even  stronger  result  by  showing  that 
the  following  problem  SHUFFLE  is  A/’T’-hard: 

An  instance  of  SHUFFLE  consists  of  n  -1-  1  words  z,ui,...  for  some  n,  and  the 
question  is  whether  z  G  L(ui  | . . .  |  ti„). 

(This  is  the  problem  of  the  last  section  where  the  number  of  strings  (i)  can  be  variable.)  We 
also  show  that  MEMBER-{U,  -,0,1}  is  WP-hard  even  if  |  is  used  only  once  in  the  expression. 

Theorem  5.1  MEMBER-{yj,-,‘,n,\}  is  jVV -complete.  The  problem  remains  jW-hard 
even  if  only  {•,  J}  are  used  in  the  expression,  or  if  {U, -,0,  |}  are  used  in  ike  expression  and  | 
appears  only  once,  /l/jo,  these  problems  remain  jVT -hard  if  an  alphabet  E  of  size  3  is  used. 

We  wiH  now  prove  this  theorem  by  a  sequence  of  lemmas. 

Lemma  5.2  SHUFFLE  is  MV-hard. 

Proof:  We  wUl  prove  this  lemma  by  doing  a  reduction  from  the  well  known  /VP-hard 
3-dimensional  matching  problem: 

Given  disjoint  sets  W  =  {w\,xv2,...,U},j},  X  =  {zi,  =2, . . . ,  Y  =  {yi.yz,  -  -  •  .Vq}. 
and  given  a  set  M  C  W  xXxY,  say  M  =  {mj,  m2,  ...,rn-,),  does  M  have  a  matching? 
I.e.,  does  there  e.xist  a  set  .V/'  C  M  in  which  every  element  of  W  \J  X  \jY  appears 
exactly  once? 

Let  c  be  a  symbol  not  in  W Xi  X  \J  Y.  We  will  construct  strings  , . .  ./i;.  over  the 
alphabet  E  =  W  \J  X  U  Y  U  {c},  such  that 

;  G  L{(ii  \  fi2  I  I  Ilk)  iff  ikT  contains  a  matching  M' . 
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We  wiU  use  the  foUowing  notation:  fj{i)  (1  <  j  <  3,  1  <  j  <  k)  denotes  the  index  of  the 
jth  component  of  m;.  Define  n(tuy)  (n(ij),7i(t/j))  to  be  the  total  number  of  occurrences  of 
the  element  Wj  {xj,yj)  in  all  of  the  elements  of  M. 

Now  define: 


r  =  ni\fM2\  ...  I  fik 


Z  =  WiW2  ...  w,  1122  2,2/12/2  ...  ...  2/^'^^«^~' 

(1)  M  contains  a  matching  z  £  L{r): 

Let  M'  be  a  matching.  Let  g{i)  {1  <  i  <  q)  denote  the  tth  element  in  M'.  Since  Af'  is  a 
matching,  there  is  a  way  to  interleave  /^^(i), —  obtain  the  first  4g  symbols  of  z. 

The  rest  of  z  can  be  trivially  obtained. 

(2)  z  G  L{r)  =i>-  M  contains  a  matching: 

The  only  way  to  choose  the  interleaving  to  obtain  the  first  iq  symbols  of  z  is  to  interleave 
q  whole  fi’s.  The  corresponding  elements  of  M  thus  form  a  matching.  □ 

Lemma  5.3  MEMBER-{[J,-,n,\}  is  MV-fiard  even  if\  appears  just  once  in  the  expression. 

Proof:  We  will  prove  this  lemma  by  doing  a  reduction  from  the  well  known  jVV-h^xd 
problem  3SAT.  Assume  that  we  are  given  a  formula  C  =  {ci,C2,...,Cm}  as  a  collection  of 
m  clauses  on  a  finite  set  {vi,V2,...,Vn}  of  variables  such  that  |c;|  =  3  (1  <  i  <  m). 

We  will  use  the  following  notation:  p,-  (1  <  t  <  m)  is  the  set  of  indices  of  the  variables 
appearing  positively  in  c,,  and  Ui  (I  <  i  <  m)  is  the  set  of  indices  of  the  variables  appearing 
negatively  in  a. 

We  construct  a  string  z  and  an  expression  r  over  the  alphabet  E  =  {ui,U2.  •  ■  • ,  Un}  such 
that 


z  6  L(r)  iff  C  is  satisfiable. 

Let  Ci  (1  <  i  <  m)  be  the  regular  expression  defined  as  follows: 

C,=  {J((E-Vk)Uer  U  ((EU£)".(U  U,)-(2U0”). 

Jgp; 

Thus  C,  n  (E  U  £)”  contains  exactly  all  words  of  length  at  most  n  in  which  (1)  at  least  one 
symbol  whose  index  is  in  n,  does  not  appear,  or  in  which  (2)  at  least  one  symbol  whose 
index  is  in  p,  appears.  Now  let  r  be  defined  as: 

.-  =  (C7,nC2n...nC'„,)  |  (Euef 

and  z  as: 


Z  =  VjV2  ...  Vn. 

(1)  C  is  satisfiable  =>  z  G  L(r): 

Let  T  be  a  satisfying  truth  assignment  for  C.  Let  the  partitioning  of  z  be  such  that  the 
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symbol  vj  belongs  to  the  LIJ.3  of  “|”  iff  T(uy)  =  1.  In  other  words  z  =  x\y\ . . .  XkVki  where 
z  =  xiX2...Xk  is  exactly  the  sequence  of  all  variables  true  under  T  in  ascending  order. 
Since  T  is  satisfying,  we  know  that  for  all :  (1  <  i  <  m)  the  word  x  (with  |r|  <  n)  either  (i) 
contains  at  least  one  symbol  with  index  in  pi  or  (ii)  does  not  contain  all  the  symbols  with 
index  in  n;.  From  this  it  easily  follows  that  x  is  an  element  of  every  (7,  (1  <  f  <  m)  and  we 
are  done. 

(2)  z  e  L{t)  C  is  satisfiable: 

Let  the  partitioning  of  z,  by  which  its  membership  in  L(r)  is  shown,  be  z  =  xiyi  ...x^yk- 
Thus  the  word  x  =  2:112  •  •  •  xy,  is  a  member  of  every  C,  (1  <  t  <  m).  Thus  we  can  define  a 
truth  assignment: 

T(uy)  = 

T  obviously  satisfies  every  clause.  □ 

Lemma  5.4  In  Lemmas  5.2  and  5.3  we  can  use  an  alphabet  of  size  3  instead  of  an  alphabet 
of  variable  size. 

Proof:  We  code  all  symbols  involved  in  the  following  way.  Lei  /i  be  a  one-to-one  mapping 
from  E  to  the  positive  integers.  Then  we  can  code  every  c  in  E  as  “#  Cor¬ 

rect  interleavings  are  now  interleavings  in  which  blocks  representing  one  symbol  are  never 
separated.  It  is  easy  to  see  that  correct  interleavings  are  uniquely  readable  and  incorrect 
interleavings  can  be  easily  detected.  □ 

We  now  prove  the  jVT  upper  bound. 

Lemma  5.5  .MEMBER-{U,-,’,n,\}  is  injW. 

Proof:  Let  z  be  a  word  in  E*  and  let  E  be  an  expression  over  E.  We  define  a  “proof” 
that  z  6  L{E)  recursively  as  follows.  First,  if  z  =  e,  then  the  symbol  e  is  a^  proof  of  (z,  E)  if 
£  e  L{E).  In  the  remaining  cases,  we  assume  z-i  e.  (i)  If  z  G  E,  then  z  is  a  proof  of  (z,  z); 
(ii)  if  Pi  is  a  proof  of  (zi,£^i),  P2  is  a  proof  of  (z2,  £’2),  and  z  =  zi  •  Z2,  then  (z,  Fi  •  P2) 
is  a  proof  of  {z,{E\  •  £2)):  (iii)  if  P  is  a  proof  of  (z,  E)  then  P  is  a  proof  of  (z,(£U  E')) 
and  of  (z,(£'  U  E))  for  any  expression  E'l  (iv)  if  Pj  is  a  proof  of  (z,  Ei)  and  P2  is  a  proof 
of  (z,  £2),  then  (z,  Pi  fl  P2)  is  a  proof  of  (z,(£i  H  £2));  (v)  if  Pi  is  a  proof  of  (zi,  £1),  P2 
is  a  proof  of  (z2,  £2),  and  z  G  L^zi  i  Z2),  then  (z,  Pi  |  P2)  is  a  proof  of  (z,(£i  |  £2)):  (vi)  if 
z  =  ziZ2...Zk  for  some  k  >  I  and  words  z,  5=  £  for  1  <  i  <  ^,  and  if  P,  is  a  proof  of  (z,,  £) 

for  1  <  i  <  then  (z,  Pi . Pt)  is  a  proof  of  (z,  (£')). 

Let  Q  be  the  relation  Q(z,  E,  P)  ilt  P  is  a  proof  of  (z,  £).  The  question  “£  G  L{E)V' 
can  be  solved  in  polynomial  time.  Since  also  the  question  “z  G  L(zi  i  Z2)?”  can  be  solved 
in  polynomial  time  (see  Section  4),  it  is  easy  to  see  that  Q  can  be  computed  in  polynomial 
lime.  By  induction  on  the  structure  of  £  it  is  not  hard  to  verify  that,  if  P  is  a  proof  of 
(z,  E)  and  z  =  £,  then  |P|  <  2iz]j£l.  We  illustrate  the  induction  step  for  case  (vi)  (star): 


1  if  Vj  G  X 
0  ilvj  ^x  . 
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ii?i  <  |2|+t+2+x;i/’ii 

1  =  1 
^  k 

<  1-1  -r  A:  +  2  +  ^  2jz,-||£|  by  induction 

t=i 

<  2|zl  T- 2 -r  2|2||£|  since  A:  <  |z|  and  z  =  zi . . .  zjt 

<  2|z|(|i?|  -f  3)  sir.ce  |z|  >  1 
=  2|z||(r)|. 

Now  we  can  write  z  G  iff  {3?  :  Q{z,E,P)).  It  follows  that  the  membership 
problem  belongs  to  //'P.  □ 

Lemmas  5.2  -  5.5  prove  Theorem  5.1 


6  .  Inequivalence  for  Expressions  without  Star 

There  are  few  natural  problems  known  to  be  complete  in  the  class  E?  of  the  polynomial- 
lime  hierarchy  [Stock??].  In  this  section,  we  add  another  problem  to  this  list  by  showing 
that  INEQ-{U, 1}  is  E^-complete.  The  proof  wilt  make  use  of  the  fact  that  interleaving  is 
powerful  enough  to  simulate  addition  of:  positive  integers. 

Theorem- 6.1  lNEQ-{-,Ki,\}  is  Z^-comvlete. 

Proof:  We  will  prove  this  theorem  in  two  parts.  First  that  the  problem  belongs  to  E?, 
and  then  that  it  is  E^-hard.  Both  parts  of  this  proof  are  similar  to  the  proof  that  the 
inequivalence  problem  for  integer  expressions  is  E^-complete  (Stock??). 

Membershiv 

By  induction  on  the  structure  of  E,  it  is  easy  to  show  that,  if  E  is  a  {U,  •, ; [-expression- and 
z  S  L{E)j  then  |z|  <  \E\.  Let  the  notion  of  a  “proof"  and  the  predicate  Q  be  defined  as  in 
the  proof  of  Lemma  5.5.  Then  we  can  write: 

{Eu  S2)sINEQ{U,-.  1}  iff  (=z:(=Fi:Q(r,Ei,Pi))--(5P2:Q(z,E2.P2))). 

Standard  manipulation  of  quantifiers  and  The-.rem  3.1  of  [Stock??]  imply  now  that  INEQ- 
{U,  •,  1}  is  in  E^. 

tlardness 

We  first  show  that,  using  certain  format  requirements,  we  can  simulate  addition  of  positive 
integers  by  interleaving. 

Let  for  1  <  :  <  71  -r  1  and  1  <  c  <  tti,  be  positive  integers.  For  each  k,  let  s*  be 
the  sum  of  for  1  <  i  <  ti  ^  1.  Let  E  be  the  expression: 

E  =  1“'.* -o- -6 l“'-'"-6  I 

I-i,.,  .s.  jc,.,  ...  ja,.„  .j  j 

-  5  -  .6  -b. 


n 


Let  R  be  the  set  of  words  over  alphabet  {1,6}  such  that  every  block  of  consecutive  b’s  has 
length  at  least  n  +  1- 

Lemma  6.2  L{B)f)R  contains  the  single  word  I-*'  -6”’^*  ...  ym 

Proof:  For  a  word  in  L(JS)  the  only  way  to  build  n+1  consecutive  6’s  is  to  first  interleave  all 
leading  I’s  from  all  n+ 1  arguments  of  the  interleaving  operator  (i.e.,  l“'-‘ ,  , . . . ,  l‘*"+'-‘ ), 

and  then  aU  first  6’s  of  the  arguments,  etc.  □ 

We  now  show  the  desired  hardness  result  by  doing  a  reduction  from  (82^  DN’F),  which 
is  showa  to  be  [Stock??,  Wrath??).  An  instance  of  {82  Ci  DNF)  is  a  Boolean 

formula  C?(A’i,A'2)  where  Xj  {j  =  1,2)  is  aset  of  variables  {rji,Zj2 . -jn},  and  where  G 

is  in  disjunctive  normal  form,  i.e.,  G  =  C7i  V  C2  V . . .  V  Cm,  where  each  Cjt  is  a  conjunction  of 
literals;  the  question  is  whether  3Jl’iVA’2(G(A’i,X2)  =  1).  We  can  assume  that  a  variable 
and  its  negation  do  not  both  appear  in  the  same  clause. 

In  order  to  show  the  ES-hardness  of  INEQ-{-,U, )}  we  will  construct  expressions  £«,  82, 
and  A  such  that 


VXi=X2(G(XuX2)  =  0) 

_  iff  _  (1) 

LiEi  u  R)  C  L(E2  U  R). 

Letting  R  be  defined  as  in  Lemma  6.2,  the  expression  R  will  have  the  properties  that 
L(R) n  R=‘d  and  L{Ei)  -  R.C  L(R).  It  is  easy  to  verify  that  those  two  properties  imply 
that 

L{Ex  U  7!)  C  L{E2  U  'R)  iff  L{Ei)  r^RC  £(£2)  n  R. 

Therefore,  to  prove  (1)  it  suffices  to  show 


'’iXxzX2{G{XuX2)  =  0) 

iff 

L{Ex)^.RCL{E2)r\R. 

Let  us  first  define  R.  Since  all  words  in  L{E\)  are  bounded  in  length  by  M  :=  \E\\,  the 
following  will  do: 

A  =  ((hUlUe)'**- lU£)-0'(^'Ufr“*  (I 

Let  (...)  be  II  if  the  expression  in  the  brackets  is  true  and  I  if  it  is  false.  Let  be  the 
opposite. 

The  expression  B\  is  now: 

Bi  =  ([-li  c  Gij  ■  6  •  [sn  c  G2j -5  •••  {zij  c  Cm)  •  6 

S  G\\  •  6  -  (->111  =  G2i  -  6  ■  f-su  6  Cm]  •  5)  j 
interleaved  with  similar  subexpressions  for  ={2,...,zir.  I 
I”-* -6- 1""^ -6  ••• 
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(The  last  subexpression  contains  m  repetitions  of  I""*"-  •  b.) 

Let  F  he  (1  U  1^  U  U . . .  U  1^"). 

The  expression  E2  is: 

E2  =  ((121  €  Cl]  •  6  •  [x2i  G  02]  - b  [x2i  G  Cm]  ■  h 

U  (-221  G  Cl)  •  6  •  [-’221  G  C72]  *  h  •  ■  •  [-221  G  Cm]  ■  0)  1 
interleaved  with  similar  subexpressions  for  S22,---,^2n  | 

F-b-F-b--- F-b. 

If  we  now  restrict  the  words  in  L{E\)  and  £{£2)  to  be  in  R,  we  can  use  Lemma  6.2  to 
conclude  that  all  words  in  L(Ei)  n  R  and  in  £(£2)  r\  R  are  of  the  form 

2/  =  -  b^*^  •  •  6""^*  -  ■  -  •  b^*^. 

It  is  useful  to  write  numerical  expressions  for  the  numbers  Sk,  1  <  i  <  m.  For  words  in 
£{Ei)n  R,  the  express.uns  ate  functions  of  0-i  valued  variables  pi,  for  1  <  :  <  Setting 
pii  =  0  (tesp..  Pi,-  =  1)  means  that  we  choose  the  LHS  (tesp.,  RHS)  of  the  uh  union  in  Ei 
to  produce  the  corresponding  word  in  £{Ei)n  R.  We  also  interpret  as  being  either  1 
or  2  (as  opposed  to  1  or  11).  Now  y  5  L(Ci)n  R  iff  there  are  pi,  G  {0,1}  such  that,  for 
1  <  1:  <  m, 

n  _ _ 

Sk  =  23((1  -  Pl0(sii  €  Ck]  -r  pu[-su  G  Cjkj)  +  (71+  I). 

»=1 

The  numerical  expressions  for  words  in  L(£J2)nil  involve  0-1  valued  variables  p2,  which, 
as  above,  indicate  whether  the  LHS  or  RHS  of  each  union  in  E2  is  used.  These  expressions 
also  involve  variables  /«  for  1  <  •?:  <  m,  where  I  <  fk  <  2.a  for  all  k-,  here  /*  indicates 
which  word  is  taken  from  the  .l:th  occurrence  of  F  in  £2-  Now  y  e  £{£2)^1  R  sit  there  are 
P2i  G  {0,1}  and  fk  G  {1,2,...  ,27i}  such  that,  for  I  <  k  <  m, 

=  ^((1  “  .52.)(x2t  G  Ck]  +  ?2«{”X2'.  G  C-f])  +  fk- 

i=i 

We  now  can,  as  in  (Stock??),  identify  four  facts  about  £;  and  £2-  The  following  termi¬ 
nology  is  used.  If  X  is  a  set  of  variables,  an  X- assignment  is  an  assignment  of  truth  values 
to  the  variables  in  X.  We  say  that  an  A'-assignment  a  kills  the  clause  Ch  if  either  some 
literal  2  appears  in  Ck  5.nd  2  is  assigned  value  false  by  a  or  some  literal  -^2  appears  in  Ck 
and  2  is  assigned  value  true  by  a. 

(a)  For  each  y  G  £(£1)  H  R,  2n  —  I  <  <  Six  +  1  for  I  <  k  <  and  there  is  an 

Aj-assignment  such  that,  for  1  <  .^  <  m,  s:-  =  3:1  -r- 1  Ht  the  assignment  does  not  kill 
Ck.  ' 

(b)  For  each  Aj-assignment  there  is  a  y  G  Z.(iFi)nR  such  that,  for  1  <  .c  <  m,  st  =  3.i-r  1 
iff  the  assignment  does  not  kill  C%. 

(c)  For  each  y  G  £(£2)  n  R  there  is  an  A2-assignmeni  such  that,  for  i  <  i  <  m,  if 
Sk  =  3n  +  1  then  the  assignment  kills  Ck- 
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(d)  Let  A2  be  an  Xa-assignment  and  y  be  a  word  over  {1,6}’  having  the  form  - 

pj  .  jn-i-i  _  gm-h  that  2n  -r  1  <  Sfc  <  3n  -r  1  and  (sjt  =  Gn  -f  1)  =c>-  (A2  kills  Ck)  for 
I  <  h  <711.  Then  y  S  L{E2)  Cl  R. 

The  proofs  of  (a)-(d)  are  not  difficult.  In  each  case,  we  must  draw  a  correspondence 
between  a  truth  assignment  and  a  word  y.  As  just  noted,  each  word  corresponds  to  values 
for  the  0-1  variables  pi,  or  p2,.  The  correspondence  between  these  variables  and  the  Boolean 
variables  in  G  is  that  Pj,  =  1  iff  Sj,  is  assigned  value  true.  We  illustrate  this  for  (a),  leaving 
the  other  cases  to  the  reader. 

Let  y  e  L(Ei)  n  R.  Since  each  c.xpression  [..,]  is  cither  1  or  2,  it  is  obvious  that 
2n  -r  1  <  Sfc  <  3n  -r  1  for  all  A:.  Consider  the  Xi-assignment  obtained  from  y  via  the  pi, 
as  just  described.  Note  that  s*;  =  3n  -r  1  iff  “2”  contributes  to  each  of  the  n  terms  of  the 
sum.  Suppose  that  xu  6  Ck-  Then  [xu  G  Ck]  has  (integer)  value  1.  Therefore,  the  tih  term 
contributes  “2”  to  the  sum  iff  pii  =  1  iff  =1,  is  true.  Similarly,  if  ->11,  £  Ck,  then  the  :ih 
term  contributes  “2”  to  the  sum  iff  p-,,  =  0  iff  xu  is  false.  It  follows  that  st  =  3:1  -r  I  iff  Ck 
is  not  killed. 

.  Remember  that  our  goal  was  to  show 

'^X^=X2iG(Xi,X2)  =  Q) 
iff 

£(£i)ni2C  L{E2)nR. 

Since  G(Xi,  Jj)  =  0  iff  all  clauses  are  killed,  it  is  easy  to  prove  “only  if”  from  (a)  and  (d), 
while  “if"  follows  from  (b)  and  (c).  As  noted  above,  this  proves  (I).  Finally,  from  (i)  we 
have 

SX,v;f2(G(.Yi,.Y2)=l) 

iff 

L{Ei  u  £2  u  1)  =  £(£2  u  li). 


7  Inequivalence  for  Expressions  with  Star 

Let  EXPSP.ACE  denote  the  class  of  decision  problems  solvable  by  deterministic  Turing 
machines  within  space  d~  for  some  constant  d.  The  problem  NEC-{U, -,n}  is  known 
to  be  EXPSP.ACE-complcte.  This  was  first  proved  by  Hunt  Tiunt"3'  who  also  proved 
that  this  problem  requires  space  c''""  for  some  constant  c  >  i.  The  proof  was  simplified 
by  Purer  ’FurerSOl  and  the  lower  bound  was  improved  to  c~.  We  show  in  this  section 
that  EXPSP.\CE-completeness  of  NEC  and  INEQ  holds  also  if  the  intersection  operator  is 
replaced  by  the  interleaving  operator. 


Theorem  7.1  /iV£Q-{-,U,  •,  |}  and  •V£C.{-,u, j}  are  EXPSPACE-cxrr.piete. 


10 


Proof: 

(1)  INEQ-{U,-,»,|}  e  EXPSPACE. 

Given  {U, I}-expressions  Ei  and  E2  of  length  at  most  n,  it  is  easy  to  build  NFA’s 
Ml  and  M2  with  0(2”)  states  which  accept  L{Ei)  and  £(£2),  respectively.  The  product 
construction  of  Section  2  is  used  for  (.  Using  the  simulation  method  described  in  Section  4, 
it  is  easy  to  show  that  equivalence  of  NTA’s  can  be  decided  by  a  nondeterministic  Turing 
machine  within  space  proportional  to  the  size  of  the  NF.A’s  (Thm.  13.14  of  [HU79]  uses  a 
similar  proof). 

(2)  NEC-{U,-,».|}  is  EXPSP.4CE-hard. 

Furer  proves  in  (FurerSOj  the  EXPSP.ACE-hardness  of  NEC-{U,-, -,0}  by  doing  a  generic 
reduction  from  an  exponential-space  Turing  machine.  This  proof  will  serve  as  a  basis  for  our 
proof.  We  will  show  that  by  adding  new  format  requirements  for  words  describing  accepting 
computations  we  can  simulate  the  intersection  operator  by  the  interleaving  operator. 

The  key  in  Furer's  proof  is  that  the  there  is  a  succinct  (i.e.,  its  length  is  Oin))  expression 
with  intersection  r-  which  describes  the  language  Pn  —  {wyxa"  :  to  S  r*,{a/j  =  ti,~(  £  F}, 
where  F  is  a  hnite  alphabet  and  where  vi^  denotes  the  reverse  of  vj.  r-  can  be  denned 
inductively  as  follows: 


ro=F 

=  F  ■  Tj '  F  n  U7€r7  ■  *  7* 

Thus  r-.  contains  n  nested  occurrences  of  “H”.  We  now  show  that  we  can  describe  a 
language  similar  to  Pn  by  an  expression  which  contains  n  nested  occurrences  of  “I",  provided 
that  words  are  required  to  have  a  certain  restricted  format.  Let  F  =  'fih  Let  c  be 

a  symbol  not  in  F.  If  u/  =  witt'a... where  roi  6  F  for  1  <  :  <  n,  and  if  k  is  a  positive 
integer,  then 

Also,  =  £.  Letting  A  be  any  langu^e  over  F,  define  .4^''^  =  { £  A). 

Words  having  the  required  format  are  in  the  set  defined  as: 

=  (£••)(<)  =  ^  u  U 


Let  the  expression  s-.  be  defined  inductively  as  follows: 

Jo  =  F  •  c 

1  (LV^r 7 •  c •  (F  •  cj*  ■  7  ■  -)• 

Note  that  the  length  of  j;  is 

We  now  claim  that  those  words  in  S;  which  are  restricted  to  be  in  describe  a 

language  similar  to  the  one  desenbed  by  r,.  This  will  be  proved  in  Lemma  7.4,  following 
two  preliminary  lemmas.  The  firsi  lemma  follows  immediately  from  the  uenniiioa  of  the  S;. 


Lemma  7.2  Lf  to  £  £(j;),  tisa  to 


7j?c  fsT  same  7  S  F  zr.d  y  £  (F  'J  { ^i)*. 


il 


To  state  the  second  lemma,  we  need  a  definition.  If  w  6  (F  U  {c})',  let  A/(?y)  be  the 
maximum  length  of  a  subword  u  ol  w  such  that  u  6  F*  U  {c}*.  Note  that  if  lo  6  L{y\z)y 
then  M(ty)  <  Af(y)  +  M{z). 

Lemma  7.3  If  w  e  L{sj),  then  M{vj)  <  j  +  1. 

Proof:  The  proof  is  by  induction  on  j.  The  bask  y  =  0  is  obvious.  Assume  the  lemma  is 
true  for  some  j.  To  prove  the  induction  step,  let  m  G  L{sj+\).  Then  w  6  L{y  \  z)  for  some 

y  e  •  Sj  ■  and  z  G  i(U76r7  •  c  •  (F  •  c)*  ■  7  •  c). 

By  Lemma  7.2  and  the  induction  hypothesis,  M(y)  <  j  +  1.  It  is  obvious  that  M(z)  =  1. 
So  Af  (lu)  <  j  +  2.  □ 

We  can  now  prove  the  connection  between  L(sy)  and  P,. 

Lemma  7.4  L{sj)  fl 


Proof:  Our  proof  will  be  by  induction  on  j. 

Induction  Basis:  If  j  =  0,  then  L{sq)  n  R^^^  =  P  ■  c  =  (Po)^’^. 

Induction  Hypothesis:  L(sj)n  =  (Py)0+0. 

Induction  Step:  We  want  to  show  that  L{sj+i)C\  It  is  easy  to  see  that 

(Pj+i)('>+^)  C  L{sj+i)C\R^^'^'^\  so  we  only  show  the  opposite  inclusion.  Any  word  in 
is  made  of  “chunks”  consisting  of ;  +  2  identical  symbols  of  F  followed  by  j  +  2  c’s.  Let 
zyz  G  L{S)+i)  n  where  i  is  the  first  chunk,  z  is  the  last  chunk,  and  y  is  all  chunks 

in  between.  Using  Lemma  7.2,  the  first  and  the  last  chunk,  r  and  z,  must  result  from  the 
interleaving  of  and  7  •  c.  Moreover,  .since  the  same  7  must  be  used  for  both  x 

and  2,  we  have  x  =  z.  The  word  y  must  result  from  the  interleaving  of  some  y'  G  L(s;) 
and  (F  -c)*.  From  this  and  Lemma  7.3,  we  can  conclude  that  only  those  words  y'  G  L{S}) 

which  are  also  in  can  be  used  to  form  a  word  in  By  the  induction  hypothesis, 

y'  G  (Pj)^-'"*’^^-  Since  y  is  a  concatenation  of  chunks,  it  follows  that  y  G  Since 

3  =  2,  it  follows  that  xyz  G  (P,+i)^-''^^^  Q 

We  now  describe  the  reduction.  For  simplicity,  we  do  the  reduction  from  a  deterministic 
one-tape  Turing  nachine  M  with  space  bound  2"  -  3.  The  extension  to  general  exponential 
space  bounds  is  straightforward.  Let  M  have  tape  alphabet  T,  state  set  S,  accepting  states 
F,  and  start  state  qo.  Let  x  be  an  input  to  M,  and  let  n  =  |zl  and  m  =  2”  -  1.  Let 

StD  =  TU(S’xT)U  {S}.  .A.n  ID  of  Af  is  a  word  of  length  m  in  (Sid)"  of  the  form  Sw(y,of)v3 

where  uw  G  T';  this  ID  means  that  the  string  uav  is  written  on  the  tape,  and  M  is  in  state 
q  scanning  the  symbol  or.  (This  representation  of  ID’s  is  slightly  different  than  the  one  used 
in  (FurerSO),  but  it  is  convenient  for  our  purposes.) 

As  in  [FurerSO],  we  use  “marked  binary  numbers”  to  index  the  symbols  of  an  ID.  A. 
marked  binary  number  is  a  word  over  the  alphabet  {0,Q,  1,1}  in  the  language  described  by 
the  expression  (OU  l)*iQ*UQ’;  i.e.,  the  rightmost  (lowest  order)  1  is  marked,  as  well  as  all 
O’s  to  the  right  of  this  1;  and  in  the  representation  of  0,  all  O’s  are  marked.  For  integer  j 
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with  0  <  j  <  m,  let  [;]  denote  the  length-n  marked  binary  representation  of  j.  The  marking 
allows  the  successor  relation  to  be  tested  locally  as  follows.  Define  succ(O)  =  succ(Q)  —  {0,1} 
and  succ(l)  =  succ(l)  =  {1,Q}.  If  yn  •••  yi  =  (j)  and  z  is  a  marked  binary  number 

of  length  n,  then  z  =  (j  +  1  mod  2”]  iff  zj  6  succ(yj)  for  1  <  ;  <  n. 

Let  S  =  Sid  U  {0,Q,  1, !,#,&,  c}  and  F  =  £  -  {c}.  The  accepting  computation  of  M 
on  input  x,  provided  that  it  esdsts,  is  represented  by  the  following  word  a  G  S*: 

a  = 


where 


a'  =  &[0]^#[0]&[l]^au(l]&[2)''au(2]&... 

■ . .  H  &  [0]^  #  (0)  &  [1)^  a2.i  (1)  &  • . .  (2) 

■  •  •  [rnf- a2.m  H  ^  (0)^  #  (0)  ^  (l)^  <^3,1  [1]  &  ■  ■  • 


•  •  ■  °‘k,m  [m]  &  [0]^  #  [0]  & 

where  a;  =  a,-,ia,-_2 . .  .a,,^  is  the  ith  ID  in  the  computation  of  M  on  input  x.  (In  [FurerSO], 
the  word  a'  is  used  to  represent  an  accepting  computation.)  We  say  that  a  word  a  G  S*  has 
the  correct  framework  if  a  =  for  some  word  a'  as  in  (2),  where  a,-,!  =  a{^m  =  S  for 

1  <  t  <  1,  but  where  the  symbols  a,,;,  for  1  <  :  <  1  and  1  <  ;  <  m,  can  be  any  symbols  of 
2id- 

We  now  simply  have  to  enumerate  the  mistakes  which  imply  that  a  word  is  not  a 
computation  of  M  on  input  x.  Each  type  of  mistake  is  described  by  an  expression.  Letting 
Bx  be  the  union  of  these  expressions, ‘it  follows  that  L{Ex)  5^  E*  iff  Af  accepts  x.  The 
length  of  Ex  will  be  0{v}).  The  following  enumeration  of  mistakes  was  chosen  to  highlight 
the  more  interesting  and  original  parts  of  the  construction.  For  example,  we  consider  “not 
having  the  correct  framework”  to  be  a  single  type  of  mistake,  even  though  this  could  be 
broken  down  into  several  types  of  lower  level  mistakes. 

0.  The  expression  Eo  describes  all  words  not  in 

1.  When  restricted  to  words  in  the  expression  Bi  describes  all  words  which  do 

not  have  the  correct  framework. 

2.  When  restricted  to  words  having  the  correct  framework,  E2  describes  all  words  such 
that  a\  is  not  the  initial  ID  of  M  on  input  x  (i.e.,  aj  ^  S{qo,x\)x2  ■  ■ 

where  B  denotes  the  blank  tape  symbol),  or  such  that  no  symbol  of  the  form  (a,  a) 
appears  where  q  is  an  accepting  state. 

3.  When  restricted  to  words  having  the  correct  framework,  E^  describes  all  words  which 

have  a  “computation  error”,  i.e.,  words  such  that  some  2,+-,,;  with  1  <  ;  <  m  does 
not  follow  correctly  from  by  the  transition  rules  of  M. 

We  first  describe  Ez  in  detail,  since  it  is  the  more  interesting  part  of  the  construction. 
Let  /  :  (Eid)^  Sid  be  such  that,  in  any  correct  computation,  a,+i.;  =  /(a,.j_i,a,j,a,,j.j.i) 
for  all  1  <  i  <  /fc  and  1  <  ;  <  m.  All  such  occurrences  can  be  found,  because  is  to 
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the  left  of  ({i]'^)^"’*’^\  ‘‘•nd  there  is  exactly  one  block  of 

#’s  between  them.  Thus  the  relevant  part  of  a  word  representing  an  accepting  computation 
must  look  as  the  following: 


([;■  -  •  ([j  -  •  &("+!)  • 

([j]/l)(n+l)  .  (a, ■__,)("+!)  .  ([j])(''+‘)  ■  . 

Hi  +  -  (fj  +  !])("+')  ■  &(”+’) 

^(n+l) 


Now  we  can  construct  an  expression  similar  to  the  one  in  Lemma  7.4  to  denote  all  wrong 
computation  steps.  Let  /z, be  symbols  in  SiD)  corresponding  to  ai^j,  a^j+i, 

respectively. 

<o(0  =  (r-cr+‘-<-c.((r-(#)),:)-.#.c.((r-{#)).c)- 

<,+!«)  =  (r^+‘  ■  c'+‘  •  1,«)  ■  P-"'  ■  c'-"')  I  (U,sr  7  •  c  •  (r  ■  c)-  ■  7  •  c)- 

As  in  the  proof  of  Lemmas  7. 2-7.4,  the  following  can  be  proved  by  induction  on  j. 

Lemma  7.5  w  6  n  iff  there  exist  words  2  €  P,  u  S  F”'*’',  and  u,y  S 

(r -{#})*,  s^ch  thatw  =  (zu^vij:yz^p*^'>. 


Now  E2  is  the  union,  over  all  €  EiDi  of: 


E’  .  •  E-. 


As  mentioned,  Eq  denotes  the  mistake  of  a  word  not  being  in  VVe  split  Eo 

in  four  categories,  i.e.,  Eo  =  Uj=i  ^Oj-  Eqi  {E02)  takes  care  of  the  case  of  a  block  being 
too  short  (long),  Eqo  describes  the  case  where  not  every  other  block  is  composed  of  c’s,  and 
Eoi  describes  words  which  start  and  end  wrong: 

^01  =  UesCe  U  (E*  ■  (E  -  {cr})))  •  a  •  (a  U  e)-*  •  (e  U  ((E  -  , z.})  •  E’)) 

Eo2  =  Use  S*  ■  .  E- 

Eo3  =  (e  U  (E*  ■  c))  ■  (E  -  {c})  •  E"  ■  (E  -  {c})  •  S* 


Eoi  =  c  ■  E*  U  E'  ■  (E  -  {c}). 

The  e.xpression  E\  which  describes  all  framework  mistakes  is  conceptually  not  difficult, 
since  the  checking  can  all  be  done  “locally”,  i.e.,  the  symbols  to  be  checked  are  within 
distance  0(n“).  This  expression  can  be  based  on  the  ones  given  in  [FurerSO].  For  these 
reasons,  we  do  not  write  Ei  in  detail.  For  illustration,  we  write  an  expression  for  one  type 
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of  framework  error  where  the  marked  binary  numbers  embedded  in  the  computation  a  are 
not  incremented  correctly.  The  relevant  part  of  a  word  having  the  correct  framework  is 

. . .  ([;■])(”+')  ((j  +  1  mod  .... 

V;t  D  =  {0,0,  l,i},  and  let  r'*'  =  r  •  r*.  Recalling  that  we  can  restrict  attention  to  words 
in  the  following  describes  all  “incrementing  mistakes”: 

n— 1 

U  U  E*-(r+-c+-(D+-c+y  ■&+-c+-(i)+-c+y  •(£)-succ(a))"--c-^-S*. 
j=o  csr? 

The  interested  reader  can  easily  complete  the  construction  of  Ei  by  writing  expressions  of 
length  O(n^)  for  the  other  types  of  framework  errors. 

The  construction  of  E2  is  also  straightforward  and  is  left  to  the  reader.  □ 

Since  the  length  of  Ex  is  O(n^),  a  lower  bound  on  space  complexity  follows  by  a  standard 
argument  (e.g.,  pg.  418  in  (AHU74]). 

Corollary  7.6  There  is  a  constant  c  >  1  such  that  no  deterministic  Turing  machine  with 
space  bound  c'^  can  accept  NEC-{U,  ■,*,{}  or  INEQ-{U, 

Note  that  this  lower  bound  (c'^)  does  not  match  the  upper  bound  (d^). 

By  using  a  coding  like  the  one  described  in  the  proof  of  Lemma  5.4,  it  can  be  shown 
that  Theorem  7.1  and  Corollary  7.6  remain  true  for  expressions  over  an  alphabet  of  size  3. 
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